Abduction is a kind of logical inference described by Charles Sanders Peirce as "guessing".[1] The term refers to the process of arriving at an explanatory hypothesis. Peirce said that to abduce a hypothetical explanation from an observed surprising circumstance is to surmise that may be true because then would be a matter of course.[2] Thus, to abduce from involves determining that is sufficient (or nearly sufficient), but not necessary, for .
For example, the lawn is wet. But if it rained last night, then it would be unsurprising that the lawn is wet. Therefore, by abductive reasoning, the possibility that it rained last night is reasonable. (But note that Peirce did not remain convinced that a single logical form covers all abduction.)[3]
Peirce argues that good abductive reasoning from P to Q involves not simply a determination that, e.g., Q is sufficient for P, but also that Q is among the most economical explanations for P. Simplification and economy are what call for the 'leap' of abduction.[4]
There has been renewed interest in the subject of abduction in the fields of computer science and artificial intelligence research.[5]
Contents |
In logic, explanation is done from a logical theory representing a domain and a set of observations . Abduction is the process of deriving a set of explanations of according to and picking out one of those explanations. For to be an explanation of according to , it should satisfy two conditions:
In formal logic, and are assumed to be sets of literals. The two conditions for being an explanation of according to theory are formalized as:
Among the possible explanations satisfying these two conditions, some other condition of minimality is usually imposed to avoid irrelevant facts (not contributing to the entailment of ) being included in the explanations. Abduction is then the process that picks out some member of . Criteria for picking out a member representing "the best" explanation include the simplicity, the prior probability, or the explanatory power of the explanation.
A proof theoretical abduction method for first order classical logic based on the sequent calculus and a dual one, based on semantic tableaux (analytic tableaux) have been proposed (Cialdea Mayer & Pirri 1993). The methods are sound and complete and work for full first order logic, without requiring any preliminary reduction of formulae into normal forms. These methods have also been extended to modal logic.
Abductive logic programming is a computational framework that extends normal logic programming with abduction. It separates the theory into two components, one of which is a normal logic program, used to generate by means of backward reasoning, the other of which is a set of integrity constraints, used to filter the set of candidate explanations.
A different formalization of abduction is based on inverting the function that calculates the visible effects of the hypotheses. Formally, we are given a set of hypotheses and a set of manifestations ; they are related by the domain knowledge, represented by a function that takes as an argument a set of hypotheses and gives as a result the corresponding set of manifestations. In other words, for every subset of the hypotheses , their effects are known to be .
Abduction is performed by finding a set such that . In other words, abduction is performed by finding a set of hypotheses such that their effects include all observations .
A common assumption is that the effects of the hypotheses are independent, that is, for every , it holds that . If this condition is met, abduction can be seen as a form of set covering.
Abductive validation is the process of validating a given hypothesis through abductive reasoning. This can also be called reasoning through successive approximation. Under this principle, an explanation is valid if it is the best possible explanation of a set of known data. The best possible explanation is often defined in terms of simplicity and elegance (see Occam's razor). Abductive validation is common practice in hypothesis formation in science; moreover, Peirce argues it is a ubiquitous aspect of thought:
Looking out my window this lovely spring morning I see an azalea in full bloom. No, no! I do not see that; though that is the only way I can describe what I see. That is a proposition, a sentence, a fact; but what I perceive is not proposition, sentence, fact, but only an image, which I make intelligible in part by means of a statement of fact. This statement is abstract; but what I see is concrete. I perform an abduction when I so much as express in a sentence anything I see. The truth is that the whole fabric of our knowledge is one matted felt of pure hypothesis confirmed and refined by induction. Not the smallest advance can be made in knowledge beyond the stage of vacant staring, without making an abduction at every step.[6]
It was Peirce's own maxim that "Facts cannot be explained by a hypothesis more extraordinary than these facts themselves; and of various hypotheses the least extraordinary must be adopted."[7] After obtaining results from an inference procedure, we may be left with multiple assumptions, some of which may be contradictory. Abductive validation is a method for identifying the assumptions that will lead to your goal.
Probabilistic abductive reasoning is a form of abductive validation, and is used extensively in areas where conclusions about possible hypotheses need to be derived, such as for making diagnoses from medical tests. For example, a pharmaceutical company that develops a test for a particular infectious disease will typically determine the reliability of the test by letting a group of infected and a group of non-infected people undergo the test. Assume the statements : "Positive test", : "Negative test", : "Infected", and : "Not infected". The result of these trials will then determine the reliability of the test in terms of its sensitivity and false positive rate . The interpretations of the conditionals are: : "The probability of positive test given infection", and : "The probability of positive test in the absence of infection". The problem with applying these conditionals in a practical setting is that they are expressed in the opposite direction to what the practitioner needs. The conditionals needed for making the diagnosis are: : "The probability of infection given positive test", and : "The probability of infection given negative test". The probability of infection could then have been conditionally deduced as , where "" denotes conditional deduction. Unfortunately the required conditionals are usually not directly available to the medical practitioner, but they can be obtained if the base rate of the infection in the population is known.
The required conditionals can be correctly derived by inverting the available conditionals using Bayes rule. The inverted conditionals are obtained as follows: The term on the right hand side of the equation expresses the base rate of the infection in the population. Similarly, the term expresses the default likelihood of positive test on a random person in the population. In the expressions below and denote the base rates of and its complement respectively, so that e.g. . The full expression for the required conditionals and are then:
The full expression for the conditionally abduced probability of infection in a tested person, expressed as , given the outcome of the test, the base rate of the infection, as well as the test's sensitivity and false positive rate, is then given by: .
Probabilistic abduction can thus be described as a method for inverting conditionals in order to apply probabilistic deduction.
A medical test result is typically considered positive or negative, so when applying the above equation it can be assumed that either (positive) or (negative). In case the patient tests positive, the above equation can be simplified to which will give the correct likelihood that the patient actually is infected.
The Base rate fallacy in medicine,[8] or the Prosecutor's fallacy[9] in legal reasoning, consists of making the erroneous assumption that . While this reasoning error often can produce a relatively good approximation of the correct hypothesis probability value, it can lead to a completely wrong result and wrong conclusion in case the base rate is very low and the reliability of the test is not perfect. An extreme example of the base rate fallacy is to conclude that a male person is pregnant just because he tests positive in a pregnancy test. Obviously, the base rate of male pregnancy is zero, and assuming that the test is not perfect, it would be correct to conclude that the male person is not pregnant.
The expression for probabilistic abduction can be generalised to multinomial cases,[10] i.e., with a state space of multiple and a state space of multiple states .
Subjective logic generalises probabilistic logic by including parameters for uncertainty in the input arguments. Abduction in subjective logic is thus similar to probabilistic abduction described above.[10] The input arguments in subjective logic are composite functions called subjective opinions which can be binomial when the opinion applies to a single proposition or multinomial when it applies to a set of propositions. A multinomial opinion thus applies to a frame (i.e. a state space of exhaustive and mutually disjoint propositions ), and is denoted by the composite function , where is a vector of belief masses over the propositions of , is the uncertainty mass, and is a vector of base rate values over the propositions of . These components satisfy and as well as .
Assume the frames and , the sets of conditional opinions and , the opinion on , and the base rate function on . Based on these parameters, subjective logic provides a method for deriving the set of inverted conditionals and . Using these inverted conditionals, subjective logic also provides a method for deduction. Abduction in subjective logic consists of inverting the conditionals and then applying deduction.
The symbolic notation for conditional abduction is "", and the operator itself is denoted as . The expression for subjective logic abduction is then:[10] .
The advantage of using subjective logic abduction compared to probabilistic abduction is that uncertainty about the probability values of the input arguments can be explicitly expressed and taken into account during the analysis. It is thus possible to perform abductive analysis in the presence of missing or incomplete input evidence, which normally results in degrees of uncertainty in the output conclusions.
The philosopher Charles Sanders Peirce ( /ˈpɜrs/; 1839–1914) introduced abduction into modern logic. Over the years he called such inference hypothesis, abduction, presumption, and retroduction. He considered it a topic in logic as a normative field in philosophy, not in purely formal or mathematical logic, and eventually as a topic also in economics of research.
As two stages of the development, extension, etc., of a hypothesis in scientific inquiry, abduction and induction are often collapsed into one overarching concept — the hypothesis. That is why, in the scientific method pioneered by Galileo and Bacon, the abductive stage of hypothesis formation is conceptualized simply as induction. In the twentieth century this collapse was reinforced by Karl Popper's explication of the hypothetico-deductive model, where the hypothesis is considered to be just "a guess"[11] (in the spirit of Peirce). However, when the formation of a hypothesis is considered the result of a process it becomes clear that this "guess" has already been tried and made more robust in thought as a necessary stage of its acquiring the status of hypothesis. Indeed many abductions are rejected or heavily modified by subsequent abductions before they ever reach this stage.
Before 1900, Peirce treated abduction as the use of a known rule to explain an observation, e.g., it is a known rule that if it rains the grass is wet; so, to explain the fact that the grass is wet; one infers that it has rained. This remains the common use of the term "abduction" in the social sciences and in artificial intelligence.
Peirce consistently characterized it as the kind of inference that originates a hypothesis by concluding in an explanation, though an unassured one, for some very curious or surprising (anomalous) observation stated in a premise. As early as 1865 he wrote that all conceptions of cause and force are reached through hypothetical inference; in the 1900s he wrote that all explanatory content of theories is reached through abduction. In other respects Peirce revised his view of abduction over the years.[12]
In later years his view came to be:
Writing in 1910, Peirce admits that "in almost everything I printed before the beginning of this century I more or less mixed up hypothesis and induction" and he traces the confusion of these two types of reasoning to logicians' too "narrow and formalistic a conception of inference, as necessarily having formulated judgments from its premises."[20]
He started out in the 1860s treating hypothetical inference in a number of ways which he eventually peeled away as inessential or, in some cases, mistaken:
In 1867, in "The Natural Classification of Arguments",[21] hypothetical inference always deals with a cluster of characters (call them P′, P′′, P′′′, etc.) known to occur at least whenever a certain character (M) occurs. (Note that categorical syllogisms have elements traditionally called middles, predicates, and subjects. For example: All men [middle] are mortal [predicate]; Socrates [subject] is a man [middle]; ergo Socrates [subject] is mortal [predicate]". Below, 'M' stands for a middle; 'P' for a predicate; 'S' for a subject. Note also that Peirce held that all deduction can be put into the form of the categorical syllogism Barbara (AAA).)
[Deduction].
[Any] M is P |
Induction.
S′, S′′, S′′′, &c. are taken at random as M's; |
Hypothesis.
Any M is, for instance, P′, P′′, P′′′, &c.; |
In 1878, in "Deduction, Induction, and Hypothesis",[22] there is no longer a need for multiple characters or predicates in order for an inference to be hypothetical, although it is still helpful. Moreover Peirce no longer poses hypothetical inference as concluding in a probable hypothesis. In the forms themselves, it is understood but not explicit that induction involves random selection and that hypothetical inference involves response to a "very curious circumstance". The forms instead emphasize the modes of inference as rearrangements of one another's propositions (without the bracketed hints shown below).
Deduction.
Rule: All the beans from this bag are white. |
Induction.
Case: These beans are [randomly selected] from this bag. |
Hypothesis.
Rule: All the beans from this bag are white. |
Peirce long treated abduction in terms of induction from characters or traits (weighed, not counted like objects), explicitly so in his influential 1883 "A Theory of Probable Inference", in which he returns to involving probability in the hypothetical conclusion.[26] Like "Deduction, Induction, and Hypothesis" in 1878, it was widely read (see the historical books on statistics by Stephen Stigler), unlike his later amendments of his conception of abduction. Today abduction remains most commonly understood as induction from characters and extension of a known rule to cover unexplained circumstances.
In 1902 Peirce wrote that he now regarded the syllogistical forms and the doctrine of extension and comprehension (i.e., objects and characters as referenced by terms), as being less fundamental than he had earlier thought.[27] In 1903 he offered the following form for abduction:[2]
The surprising fact, C, is observed;
- But if A were true, C would be a matter of course,
- Hence, there is reason to suspect that A is true.
The hypothesis is framed, but not asserted, in a premise, then asserted as rationally suspectable in the conclusion. Thus, as in the earlier categorical syllogistic form, the conclusion is formulated from some premise(s). But all the same the hypothesis consists more clearly than ever in a new or outside idea beyond what is known or observed. Induction in a sense goes beyond observations already reported in the premises, but it merely amplifies ideas already known to represent occurrences, or tests an idea supplied by hypothesis; either way it requires previous abductions in order to get such ideas in the first place. Induction seeks facts to test a hypothesis; abduction seeks a hypothesis to account for facts.
Note that the hypothesis ("A") could be of a rule. It need not even be a rule strictly necessitating the surprising observation ("C"), which needs to follow only as a "matter of course"; or the "course" itself could amount to some known rule, merely alluded to, and also not necessarily a rule of strict necessity. In the same year, Peirce wrote that reaching a hypothesis may involve placing a surprising observation under either a newly hypothesized rule or a hypothesized combination of a known rule with a peculiar state of facts, so that the phenomenon would be not surprising but instead either necessarily implied or at least likely.[25]
Peirce did not remain quite convinced about any such form as the categorical syllogistic form or the 1903 form. In 1911, he wrote, "I do not, at present, feel quite convinced that any logical form can be assigned that will cover all "Retroductions". For what I mean by a Retroduction is simply a conjecture which arises in the mind."[3]
Peirce came over the years to divide (philosophical) logic into three departments:
Peirce had, from the start, seen the modes of inference as being coordinated together in scientific inquiry and, by the 1900s, held that hypothetical inference in particular is inadequately treated at the level of critique of arguments.[18][19] To increase the assurance of a hypothetical conclusion, one needs to deduce implications about evidence to be found, predictions which induction can test through observation so as to evaluate the hypothesis. That is Peirce's outline of the scientific method of inquiry, as covered in his inquiry methodology, which includes pragmatism or, as he later called it, pragmaticism, the clarification of ideas in terms of their conceivable implications regarding informed practice.
As early as 1866,[28] Peirce held that:
1. Hypothesis (abductive inference) is inference through an icon (also called a likeness).
2. Induction is inference through an index (a sign by factual connection); a sample is an index of the totality from which it is drawn.
3. Deduction is inference through a symbol (a sign by interpretive habit irrespective of resemblance or connection to its object).
In 1902, Peirce wrote that, in abduction: "It is recognized that the phenomena are like, i.e. constitute an Icon of, a replica of a general conception, or Symbol."[29]
At the critical level Peirce held that the hypothesis should economize explanation for plausibility in terms of the feasible and natural. In 1908 Peirce described this plausibility in some detail.[14] It involves not likeliness based on observations (which is instead the inductive evaluation of a hypothesis), but instead optimal simplicity in the sense of the "facile and natural", as by Galileo's natural light of reason and as distinct from "logical simplicity" (Peirce does not dismiss logical simplicity entirely but sees it in a subordinate role; taken to its logical extreme it would favor adding no explanation to the observation at all). Even a well-prepared mind guesses oftener wrong than right, but our guesses succeed better than random luck at reaching the truth or at least advancing the inquiry, and that indicates to Peirce that they are based in instinctive attunement to nature, an affinity between the mind's processes and the processes of the real, which would account for why appealingly "natural" guesses are the ones that oftenest (or least seldom) succeed; to which Peirce added the argument that such guesses are to be preferred since, without "a natural bent like nature's", people would have no hope of understanding nature. In 1910 Peirce made a three-way distinction between probability, verisimilitude, and plausibility, and defined plausibility with a normative "ought": "By plausibility, I mean the degree to which a theory ought to recommend itself to our belief independently of any kind of evidence other than our instinct urging us to regard it favorably."[30] For Peirce, plausibility does not depend on observed frequencies or probabilities, or on verisimilitude, or even on testability, which is not a question of the critique of the hypothetical inference as an inference, but rather a question of the hypothesis's relation to the inquiry process.
The phrase "inference to the best explanation" (not used by Peirce but often applied to hypothetical inference) is not always understood as referring to the most simple and natural. However, in other senses of "best", such as "standing up best to tests", it is hard to know which is the best explanation to form, since one has not tested it yet.
At the methodeutical level Peirce held that a hypothesis is judged and selected[18] for testing because it offers, via its trial, to expedite and economize the inquiry process itself toward new truths, first of all by being testable and also by further economies,[4] in terms of cost, value, and relationships among guesses (hypotheses). Here, considerations such as probability, absent from the treatment of abduction at the critical level, come into play. For examples:
In 1901 Peirce wrote, "There would be no logic in imposing rules, and saying that they ought to be followed, until it is made out that the purpose of hypothesis requires them."[32] In 1903 Peirce called pragmatism "the logic of abduction" and said that the pragmatic maxim gives the necessary and sufficient logical rule to abduction in general.[19] The pragmatic maxim is: "Consider what effects, that might conceivably have practical bearings, we conceive the object of our conception to have. Then, our conception of these effects is the whole of our conception of the object." It is a method for fruitful clarification of conceptions by equating the meaning of a conception with the conceivable practical implications of its object's conceived effects. Peirce held that that is precisely tailored to abduction's purpose in inquiry, the forming of an idea that could conceivably shape informed conduct. In various writings in the 1900s[4][31] he said that the conduct of abduction (or retroduction) is governed by considerations of economy, belonging in particular to the economics of research. He regarded economics as a normative science whose analytic portion might be part of logical methodeutic.[33]
Norwood Russell Hanson, a philosopher of science, wanted to grasp a logic explaining how scientific discoveries take place. He used Peirce's notion of abduction for this.[34]
Further development of the concept can be found in Peter Lipton's Inference to the Best Explanation (Lipton, 1991).
Applications in artificial intelligence include fault diagnosis, belief revision, and automated planning. The most direct application of abduction is that of automatically detecting faults in systems: given a theory relating faults with their effects and a set of observed effects, abduction can be used to derive sets of faults that are likely to be the cause of the problem.
Abduction can also be used to model automated planning.[35] Given a logical theory relating action occurrences with their effects (for example, a formula of the event calculus), the problem of finding a plan for reaching a state can be modeled as the problem of abducting a set of literals implying that the final state is the goal state.
In intelligence analysis, Analysis of Competing Hypotheses and Bayesian networks, probabilistic abductive reasoning is used extensively. Similarly in medical diagnosis and legal reasoning, the same methods are being used, although there have been many examples of errors, especially caused by the base rate fallacy and the prosecutor's fallacy.
Belief revision, the process of adapting beliefs in view of new information, is another field in which abduction has been applied. The main problem of belief revision is that the new information may be inconsistent with the corpus of beliefs, while the result of the incorporation cannot be inconsistent. This process can be done by the use of abduction: once an explanation for the observation has been found, integrating it does not generate inconsistency. This use of abduction is not straightforward, as adding propositional formulae to other propositional formulae can only make inconsistencies worse. Instead, abduction is done at the level of the ordering of preference of the possible worlds. Preference models use fuzzy logic or utility models.
In the philosophy of science, abduction has been the key inference method to support scientific realism, and much of the debate about scientific realism is focused on whether abduction is an acceptable method of inference.
In historical linguistics, abduction during language acquisition is often taken to be an essential part of processes of language change such as reanalysis and analogy.[36]
In anthropology, Alfred Gell in his influential book Art and Agency defined abduction, (after Eco[37]) as “a case of synthetic inference 'where we find some very curious circumstances, which would be explained by the supposition that it was a case of some general rule, and thereupon adopt that supposition”.[38] Gell criticizes existing 'anthropological' studies of art, for being too preoccupied with aesthetic value and not preoccupied enough with the central anthropological concern of uncovering 'social relationships' specifically the social contexts in which artworks are produced, circulated, and received.[39] Abduction is used as the basis of one gets from art to agency in the sense of a theory of how works of art can inspire a sensus communis, or the commonly-held views that a characteristic of a given society because they are shared by everyone in that society.[40] The question Gell asks in the book is, ‘how does initially to ‘speak’ to people?’ He answers by saying that “No reasonable person could suppose that art-like relations between people and things do not involve at least some form of semiosis.”[38] However, he rejects any intimation that semiosis can be thought of as a language because then he would have to admit to some pre-established existence of the sensus communis that he wants to claim only emerges afterward out of art. Abduction is the answer to this conundrum because the tentative nature of the abduction concept (Pierce likened it to guessing) means that not only can it operate outside of any pre-existing framework, but moreover, it can actually intimate the existence of a framework. As Gell reasons in his analysis, the physical existence of the artwork prompts the viewer to perform an abduction that imbues the artwork with intentionality. A statue of a goddess, for example, in some senses actually becomes the goddess in the mind of the beholder; and represents not only the form of the deity but also her intentions (which are adduced from the feeling of her very presence). Therefore through abduction, Gell claims that art can have the kind of agency that plants the seeds that grow into cultural myths. The power of agency is the power to motivate actions and inspire ultimately the shared understanding that characterizes any given society.[40]
Consequently, to discover is simply to expedite an event that would occur sooner or later, if we had not troubled ourselves to make the discovery. Consequently, the art of discovery is purely a question of economics. The economics of research is, so far as logic is concerned, the leading doctrine with reference to the art of discovery. Consequently, the conduct of abduction, which is chiefly a question of heuretic and is the first question of heuretic, is to be governed by economical considerations.
It allows any flight of imagination, provided this imagination ultimately alights upon a possible practical effect; and thus many hypotheses may seem at first glance to be excluded by the pragmatical maxim that are not really so excluded.
Methodeutic has a special interest in Abduction, or the inference which starts a scientific hypothesis. For it is not sufficient that a hypothesis should be a justifiable one. Any hypothesis which explains the facts is justified critically. But among justifiable hypotheses we have to select that one which is suitable for being tested by experiment.
.... What is good abduction? What should an explanatory hypothesis be to be worthy to rank as a hypothesis? Of course, it must explain the facts. But what other conditions ought it to fulfill to be good? .... Any hypothesis, therefore, may be admissible, in the absence of any special reasons to the contrary, provided it be capable of experimental verification, and only insofar as it is capable of such verification. This is approximately the doctrine of pragmatism.
The mind seeks to bring the facts, as modified by the new discovery, into order; that is, to form a general conception embracing them. In some cases, it does this by an act of generalization. In other cases, no new law is suggested, but only a peculiar state of facts that will "explain" the surprising phenomenon; and a law already known is recognized as applicable to the suggested hypothesis, so that the phenomenon, under that assumption, would not be surprising, but quite likely, or even would be a necessary result. This synthesis suggesting a new conception or hypothesis, is the Abduction.
Thus, twenty skillful hypotheses will ascertain what 200,000 stupid ones might fail to do.